A More Powerful Two-Sample Test in High Dimensions using Random Projection

نویسندگان

  • Miles Lopes
  • Laurent Jacob
  • Martin J. Wainwright
چکیده

We consider the hypothesis testing problem of detecting a shift between the means of two multivariate normal distributions in the high-dimensional setting, allowing for the data dimension p to exceed the sample size n. Our contribution is a new test statistic for the two-sample test of means that integrates a random projection with the classical Hotelling T 2 statistic. Working within a high-dimensional framework that allows (p, n) → ∞, we first derive an asymptotic power function for our test, and then provide sufficient conditions for it to achieve greater power than other state-of-the-art tests. Using ROC curves generated from simulated data, we demonstrate superior performance against competing tests in the parameter regimes anticipated by our theoretical results. Lastly, we illustrate an advantage of our procedure with comparisons on a high-dimensional gene expression dataset involving the discrimination of different types of cancer.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RAPTT: An Exact Two-Sample Test in High Dimensions Using Random Projections

In1 high dimensions, the classical Hotelling’s T 2 test tends to have low power or becomes undefined due to singularity of the sample covariance matrix. In this paper, this problem is overcome by projecting the data matrix onto lower dimensional subspaces through multiplication by random matrices. We propose RAPTT (RAndom Projection T-Test), an exact test for equality of means of two normal pop...

متن کامل

Evaluating E-Learning Maturity from the viewpoints of Medical Sciences Students

Introduction: Digitalization of education is considered  as  a major reforming in higher education. E-learning programs are increasingly seen as a way to reform in medical sciences education, giving access to ongoing learning and training without any time or geographical barriers. Technology is a powerful tool for effective teaching and deep learning. Therefore, the aim of this paper is evalua...

متن کامل

Small Sample Size in High Dimensional Space - Minimum Distance Based Classification

In this paper we present some new results concerning the classification in small sample high dimensional case. We discuss geometric properties of data structures in high dimensions. It is known that such a data form in high dimension an almost regular simplex even if co-variance structure of data is not unity. We restrict our attention to two class discrimination problems. It is assumed that ob...

متن کامل

پیش‌بینی اضطراب امتحان دانش‌آموزان دبیرستانی بر اساس ابعاد کمال‌گرایی آنان

Abstract: The present study aimed to predict high school students ʼ Test Anxiety based on Perfectionism dimensions. The population of the study included junior high school students of humanistic sciences, science and mathematics in Tabriz. The sample consisted of 168 people who were selected by cluster random sampling method. The Spiel berger Anxiety Test and the Multidimensional Perfectionism...

متن کامل

رابطه استعاره‌‌ها و ابعاد شخصیتی درون‌گرایی/ برون‌گرایی آیسنک

The goal of the present research was to determine the relationship between Eysenck's "E" personality dimensions and a selection of metaphorical concepts. Researches in the past have emphasized personality and linguistic components in literal language. The present research investigated metaphor as a part of figurative language, and its relation with the two personality dimensions. The initial sa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011